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SEQtJENCE LISTING 

4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: Kirschbaum, Bernd 

7 Berglund, Erick 

H Mfii .qt firern.st, Michael 



9 Polites, Greg 

11 (ii) TITLE OF INVENTION: PURIFICATION OF HIGHER ORDER TRANSCRIPTION 

12 COMPLEXES FROM TRANSGENIC 

13 NON-HUMAN ANIMALS 
15 (iii) NUMBER OF SEQUENCES: 17 

17 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESSEE: HELLER, EHRMAN, WHITE & McAULIFFE 

19 (B) STREET: 1666 K Street, N.W., Suite 300 

20 (C) CITY: Washington 

21 (D) STATE: D.C. 

22 (E) COUNTRY: USA 

23 (F) ZIP: 20006 

25 (v) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Floppy disk 

27 (B) COMPUTER: IBM PC compatible 

28 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

29 (D) SOFTWARE: Patentin Release #1.0, Version #1.30 

31 (vi) CURRENT APPLICATION DATA: 

32 (A) APPLICATION NUMBER: US/09/849,243 cY- 

33 (B) FILING DATE: 07-May-2001 & 

35 (viii) ATTORNEY/AGENT INFORMATION: 

36 (A) NAME: Granados, Patricia D. 

37 (B) REGISTRATION NUMBER: 33,683 

38 (C) REFERENCE/DOCKET NUMBER: 38005-0148 
4 0 (ix) TELECOMMUNICATION INFORMATION: 

41 (A) TELEPHONE: (202)912-2000 

42 (B) TELEFAX: (202)912-2020 
4 7 (2) INFORMATION FOR SEQ ID NO: 1: 

4 9 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 12 amino acids 

51 (B) TYPE: amino acid 

52 (D) TOPOLOGY: linear 
54 (ii) MOLECULE TYPE: peptide 

58 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

60 Met Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Val 

61 1 5 10 
64 (2) INFORMATION FOR SEQ ID NO: 2: 

66 (i) SEQUENCE CHARACTERISTICS: 

67 (A) LENGTH: 11 amino acids 

68 (B) TYPE: amino acid 

69 (D) TOPOLOGY: linear 
71 (ii) MOLECULE TYPE: peptide 
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75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

77 Met Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 

78 1 5 10 
81 (2) INFORMATION FOR SEQ ID NO: 3: 

83 (i) SEQUENCE CHARACTERISTICS: 

84 (A) LENGTH: 10 amino acids 

85 (B) TYPE: amino acid 
8 6 (D) TOPOLOGY: linear 
88 (ii) MOLECULE TYPE: peptide 

-32 f xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



94 Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 

95 1 5 10 
98 (2) INFORMATION FOR SEQ ID NO: 4: 

100 (i) SEQUENCE CHARACTERISTICS: 

101 (A) LENGTH: 9 amino acids 

102 (B) TYPE: amino acid 

103 (D) TOPOLOGY: linear 
105 (ii) MOLECULE TYPE: peptide 

109 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

111 Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 

112 1 5 

115 (2) INFORMATION FOR SEQ ID NO: 5: 

117 (i) SEQUENCE CHARACTERISTICS: 

118 (A) LENGTH: 22 base pairs 

119 (B) TYPE: nucleic acid • 

120 (C) STRANDEDNESS : single 

121 (D) TOPOLOGY: linear 
123 (ii) MOLECULE TYPE: cDNA 

126 (ix) FEATURE: 

127 (A) NAME/KEY: exon 

128 (B) LOCATION: 1. .22 

132 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

134 GGAGCAACCG CCTGCTGGGT GC • 22 
137 (2) INFORMATION FOR SEQ ID NO: 6: 

139 (i) SEQUENCE CHARACTERISTICS: 

140 (A) LENGTH: 21 base pairs 

141 (B) TYPE: nucleic acid 

142 (C) STRANDEDNESS: single 

143 (D) TOPOLOGY: linear 
145 (ii) MOLECULE TYPE: cDNA 
14 8 (ix) FEATURE: 

149 (A) NAME/KEY: exon 

150 (B) LOCATION: 1. .21 

153 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

155 CCTGTGTTGC CTGCTGGGAC G 21 
158 (2) INFORMATION FOR SEQ ID NO: 7: 

160 (i) SEQUENCE CHARACTERISTICS: 

161 (A) LENGTH: 21 base pairs 

162 (B) TYPE: nucleic acid 
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163 


(C) STRANDEDNESS: single 






164 


(D) TOPOLOGY: linear 






166 


(ii) MOLECULE TYPE: cDNA 






169 


(ix) FEATURE: 






170 


(A) NAME/KEY: exon 






171 


(B) LOCATION: 1 . . 21 






174 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 






176 


GGAGACTGAA GTTAGGCCAG C 


21 




179 


(2) INFORMATION FOR SEQ ID NO: 8: 






181 


(i) SFOIIFNCF CHARA'^TFRTSTIi^S • 








\ lU 'j/ V-"* -LJ l.'i Jj v-" i lXT.l;\XT,\if-J. i-JiX J. I_>- J. J. i— ' »- 






182 


(A) LENGTH: 76 base pairs 






183 


(B) TYPE: nucleic acid 






184 


(C) STRANDEDNESS: single 






185 


(D) TOPOLOGY: linear 






187 


(ii) MOLECULE TYPE: cDNA 






190 


(ix) FEATURE: 






191 


(A) NAME/KEY: exon 






192 


(B) LOCATION: 1. .76 






196 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 






198 


GCGGCACCAG GCCGCTGCTG TGATGATGAT GATGATGGCT GCTGCCCATG ACTGCGTAAT 


60 




200 


GCGGTCATGA CGCTTT 


16 




203 


(2) INFORMATION FOR SEQ ID NO: 9: 






205 


(i) SEQUENCE CHARACTERISTICS: 






206 


(A) LENGTH: 75 base pairs 






207 


(B) TYPE: nucleic acid 






208 


(C) STRANDEDNESS: single 






209 


(D) TOPOLOGY: linear 






211 


(ii) MOLECULE TYPE: cDNA 






214 


(ix) FEATURE: 






215 


(A) NAME/KEY: exon 






216 


(B) LOCATION: 1 . . 75 






219 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 






221 


GAAGGGGGTG GGGGAGGCAA GGGTACATGA GAGCCATTAC GTCGTCTTCC TGAATCCCTT 


60 




223 


TAGCCGCTTT GCTCG 


75 




226 


(2) INFORMATION FOR SEQ ID NO: 10: 






228 


(i) SEQUENCE CHARACTERISTICS: 






229 


(A) LENGTH: 22 base pairs 






230 


(B) TYPE: nucleic acid 






231 


(C) STRANDEDNESS: single 






232 


(D) TOPOLOGY: linear 






234 


(ii) MOLECULE TYPE: cDNA 






237- 


(ix) FEATURE: 






238 


(A) NAME/KEY: exon 






239 


(B) LOCATION: 1. .22 






242 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 






244 


CCCTATGACG TCCCGGATTA CG 


22 




247 


(2) INFORMATION FOR SEQ ID NO: 11: 






249 


(i) SEQUENCE CHARACTERISTICS: 






250 


(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



251 
252 
253 
255 
258 
259 
260 
263 

2 65 GTGGAGTGGT GCCCGGCAAG GG 



(ii) 
(ix) 



MOLECULE TYPE: cDNA 
FEATURE : 

(A) NAME/KEY: exon 

(B) LOCATION: 1..22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 



22 



(2) INFORMATION FOR SEU iU NO: 12: 



268 

270 

271 

272 

273 

275 

279 

281 

282 

284 

288 

290 

291 

292 

293 

294 

296 

299 

300 

301 

304 

306 

308 

310 

312 

314 

316 

318 

320 

322 

324 

326 

328 

330 

332 

334 

336 

338 

340 

342 



(i) 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 12 



(2) 



(ii) 
(ix) 



cDNA 

exon 
1. . 1310 



Met Gly Ser Ser His His His His His His 
1 5 10 

Arg Gly Cys 
INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: 
FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

CCATGGGCTA TCCCTATGAC GTCCCGGATT AGGCAGTCAT 
ATCATCACAG CAGCGGCCTG GTGCCGCGCG GCAGCCATAT 
CACCTTACGC TCAGGGCTTG GCCTCCCCTC AGGGTGCCAT 
TTAGTCCAAT GATGCCTTAT GGCACTGGAC TGACCCCACA 
GTCTGTCTAT TTTGGAAGAG CAAC^AAGGC AGCAGCAGCA 
AGCAGCAGCA GCAGCAACAG CAACAGCAGC AGCAGCAGCA 
AGCAGCAGCA GCAGCAGCAA CAGGCAGTGG CAGCTGCAGC 
AGCAGGCAAC ACAGGGAACC TCAGGCCAGG CACCACAGCT 
CAACTGCACC CTTGCCGGGC ACCACTCCAC TGTATCCCTC 
CCATCACTCC TGCCACGCCA GCTTCGGAGA GTTCTGGGAT 
TTGTATCCAC AGTGAATCTT GGTTGTAAAC TTGACCTAAA 
GAAACGCCGA ATATAATCCC AAGCGGTTTG CTGCGGTAAT 
GAACCACGGC ACTGATTTTC AGTTCTGGGA AAATGGTGTG 
AACAGTCCAG ACTGGCAGCA AGAAAATATG CTAGAGTTGT 
CTAAGTTCTT GGACTTCAAG ATTCAGAACA TGGTGGGGAG 
TAAGGTTAGA AGGCCTTGTG CTCACCCACC AACAATTTAG 
TTCCTGGTTT AATCTACAGA ATGATCAAAC CCAGAATTGT 
GAAAAGTTGT ATTAACAGGT GCTSAAGTCA GAGCAGAAAT 
TCTACCCTAT TCTAAAGGGA TTCAGGAAGA CGACGTAATG 



Ser Ser Gly Leu Val Pro 
15 



GGGCAGCAGC 

GGATCAGAAC 

GACTCCCGGA 

GCCTATTCAG 

ACAACAACAG 

GCAGCAGCAG 

CGTTCAGCAG 

CTTCCACTCA 

CCCCATGACT 

TGTACCGCAG 

GACCATTGCA 

CATGAGGATA 

CACAGGAGCC 

ACAGAAGTTG 

CTGTGATGTG 

TAGTTATGAG 

TCTCCTTATT 

TTATGAAGCA 

GCTCTCATGT 



CAT CAT CATC 

AACAGCCTGC 

ATCCCTATCT 

AACACCAATA 

CAGCAGCAGC 

CAGCAGCAGC 

TCAACGTCCC 

CAGACTCTCA 

CCCATGACCC 

CTGCAAAATA 

CTTCGTGCCC 

AGAGAGCCAC 

AAGAGTGAAG 

GGTTTTCCAG 

AAGTTTCCTA 

CCAGAGTTAT 

TTTGTTTCTG 

TTTGAAAACA 

ACCCTTGCCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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34 4 CCCCCACCCC CTTCTTTTTT TTTTTTTAAA CAAATCAGTT TGTTTTGGTA CCTTTAAATG 1200 

346 GTGGTGTTGT GAGAAGATGG ATGTTGAGTT GCAGGGTGTG GCACCAGGTG ATGCCCTTCT 1260 

348 GTAAGTGCCC CTTCCGGCAT CCCGGAATTC CTGCAGCCCA ACGCGGCCGC 1310 
351 (2) INFORMATION FOR SEQ ID NO: 14: 



353 (i) SEQUENCE CHARACTERISTICS: 

354 (A) LENGTH: 4286 base pairs 

355 (B) TYPE: nucleic acid 

356 (C) STRANDEDNESS : single 

357 (D) TOPOLOGY: linear 

35^ (ii) MOLECULE TYPE : cDNA 

362 (ix) FEATURE: 

363 (A) NAME/KEY: exon 

364 (B) LOCATION: 1..4286 

367 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



369 GAATTCCCCT GCAGGTCACT TAGCGTTGGC CACATAGTAG GTTCTCAAAT ACTTGTTAAT 60 

371 AAATAAGTTT GTTCGAGAAG CTGGGCAATG ATATTCTACA GCTGGAAGAA GAAACATAAT 120 

373 GATCTAGTAA TTAGCTCAAT TAAAAATAAA CGTTCTTCTT TCCTCAGAGG AGCATTTCCC 180 

375 AAGGCCTGCC TTGATAGCCA TCCAAAAAGG CCAAGCTCAT CCAATCTTGC CCTAGATTTA 240 

377 TGCTAAAATG CAGTTACAAT CGATAGGATG ACAGAAAACG ACAGCACTTA TTTAAATATA 300 

37 9 ATAGGCACTT ATTTAAATAG GAGAAGCTGT GACTTCATAG CAAGTGTTGG GGTTAGGA7U\ 360 
381 CTGGGTGGAT AAACTTGCTG ATGCTGTAGA TCTTAGCCTC TACATGAGAT CATGTGGAAA 420 
383 ATCTGAAAGC ATTTTAGGTT CCTTATGTTT GCAATCAAAT AACTGTACAC CTTTTAATTT 4 80 
385 AAAAAGTACC ATGAGGCACA CACACACACT CGCAGGAACT TTTTGGCGTA ACAAAACTAG 54 0 
387 AATTAGATCT AAAAGCTAAC TGTAGGACTG AGTCTATTCT AAACTGAAAG CCTGGACATC 600 

38 9 TGGAGTACCA GGGGGAGATG ACGTGTTACG GGCTTCCATA AAAGCAGCTG GCTTTGTUITG 660 
391 GAAGGAGCCA AGAGGCCAGC ACAGGAGCGG ATTCGTCGCT TTCACGGCCA TCGAGCCGAA 720 
393 CCTCTCGCAA GTCCGTGAGC CGTTAAGGAG GCCCCCAGTC CCGACCCTTC GCCCCAAGCC 7 80 
395 CCTCGGGGTC CCCGGGCCTG GTACTCCTTG CCACACGGGA GGGGCGCGGA AGCCGGGGCG 840 
397 GAGGAGGAGC CAACCCCGGG CTGGGCTGAG ACCCGCAGAG GAAGACGCTC TAGGGATTTG 900 
399 TCCCGGACTA GCGAGATGGC AAGGCTGAGG ACGGGAGGCT GATTGAGAGG CGAAGGTACA 960 
401 CCCTAATCTC AATACAACCT TTGGAGCTAA GCCAGCAATG GTAGAGGGAA GATTCTGCAC 1020 
403 GTCCCTTCCA GGCGGCCTCC CCGTCACCAC CCCCCCCAAC CCGCCCCGAC CGGAGCTGAG 1080 
4 05 AGTAATTCAT ACAAAAGGAC TCGCCCCTGC CTTGGGGAAT CCCAGGGACC GTCGTTAAAC 114 0 
407 TCCCACTAAC GTAGAACCCA GAGATCGCTG CGTTCCCGCC CCCTCACCCG CCCGCTCTCG 1200 
409 TCATCACTGA GGTGGAGAAG AGCATGCGTG AGGCTCCGGT GCCCGTCAGT GGGCAGAGCG 12 60 
411 CACATCGCCC ACAGTCCCCG AGAAGTTGGG GGGAGGGGTC GGCAATTGAA CCGGTGCCTA 1320 
413 GAGAAGGTGG CGCGGGGTAA ACTGGGAAAG TGATGTCGTG TACTGGCTCC GCCTTTTTCC 1380 
415 CGAGGGTGGG GGAGAACCGT ATATAAGTGC AGTAGTCGCC GTGAACGTTC TTTTTCGCAA 14 40 
417 CGGGTTTGCC GCCAGAACAC AGGTAAGTGC CGTGTGTGGT TCCCGCGGGC CTGGCCTCTT 1500 
419 TACGGGTTAT GGCCCTTGCG TGCCTTGAAT TACTTCCACG CCCCTGGCTG CAGTACGTGA 1560 
421 TTCTTGATCC CGAGCTTCGG GTTGGAAGTG GGTGGGAGAG TTCGAGGCCT TGCGCTTAAG 1620 
423 GAGCCCCTTC GCCTCGTGCT TGAGTTGAGG CCTGGCCTGG GCGCTGGGGC CGCCGCGTGC 1680 
4 25 GAATCTGGTG GCACCTTCGC GCCTGTCTCG CTGCTTTCGA TAAGTCTCTA GCCATTTAAA 1740 
427 ATTTTTGATG ACCTGCTGCG ACGCTTTTTT TCTGGCAAGA TAGTCTTGTA AATGCGGGCC 1800 
429 AAGATCTGCA CACTGGTATT TCGGTTTTTG GGGCCGCGGG CGGCGACGGG GCCCGTGCGT 1860 
4 31 CCCAGCGCAC ATGTTCGGCG AGGCGGGGCC TGCGAGCGCG GCCACCGAGA ATCGGACGGG 1920 
433 GGTAGTCTCA AGCTGGCCGG CCTGCTCTGG TGCCTGGCCT CGCGCCGCCG TGTATCGCCC 1980 
4 35 CGCCCTGGGC GGCAAGGCTG GCCCGGTCGG CACCAGTTGC GTGAGCGGAA AGATGGCCGC 2040 
437 TTCCCGGCCC TGCTGCAGGG AGCTCAAAAT GGAGGACGCG GCGCTCGGGA GAGCGGGCGG 2100 
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